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Noise mechanisms in quantum systems can be broadly characterized as either coherent (i.e., unitary) or inco¬ 
herent. For a given fixed average error rate, coherent noise mechanisms will generally lead to a larger worst-case 
error than incoherent noise. We show that the coherence of a noise source can be quantified by the unitarity, 
which we relate to the average change in purity averaged over input pure states. We then show that the unitarity 
can be efficiently estimated using a protocol based on randomized benchmarking that is efficient and robust 
to state-preparation and measurement errors. We also show that the unitarity provides a lower bound on the 
optimal achievable gate infidelity under a given noisy process. 


To harness the advantages of quantum information process¬ 
ing, quantum systems have to be controlled to within some 
maximum threshold error. Certifying whether the error is be¬ 
low the threshold is possible by performing full quantum pro¬ 
cess tomography [1, 2], however, quantum process tomogra¬ 
phy is both inefficient in the number of qubits and is sensitive 
to state-preparation and measurement errors (SPAM) [3], 

Randomized benchmarking [4-9] and direct fidelity esti¬ 
mation [10, 11] have been developed as efficient methods for 
estimating the average infidelity of noise to the identity. How¬ 
ever, the worst-case error, as quantified by the diamond dis¬ 
tance from the identity, can be more relevant to determining 
whether an experimental implementation is at the threshold 
for fault-tolerant quantum computation [12], The best possi¬ 
ble bound on the worst-case error (without further information 
about the noise) scales as the square root of the infidelity and 
can be orders of magnitude greater than the reported average 
infidelity [13, 14], 

However, this scaling of the worst-case bound is only 
known to be saturated by unitary noise. If the noise is known 
to be stochastic Pauli noise, the worst-case error is directly 
proportional to the average infidelity [9], vastly improving on 
the general bound. Consequently, quantifying the intermedi¬ 
ate regime between unitary and fully incoherent noise may 
allow the bound on the worst-case error to be substantially 
improved. 

Randomized benchmarking is also emerging as a useful 
tool for diagnosing the noise in an experiment [15, 16], which 
can then be used to optimize the implementation of gates by 
varying the experimental design. In this spirit, an experi¬ 
mental protocol for characterizing the coherence of a noise 
channel will be an important tool as the quest to build a fault- 
tolerant quantum computer progresses. 

In this paper, we present a protocol for estimating a particu¬ 
lar quantification of the coherence of noise, which we term the 
unitarity, in the experimental implementation of a unitary 2- 
design. Our protocol is efficient and robust against SPAM, 
and is a minor modification of randomized benchmarking. 
The unitarity is defined as the average change in the purity 
of a pure state after applying the noise channel, with the con¬ 
tributions due to the identity component subtracted off, (see 
Eq. (4)) and is closely related to the purity of the Jamiolkowski 
isomorphic state (see Proposition 9). We show that the uni¬ 


tarity is invariant under unitary gates and attains its maximal 
value if and only if the noise is unitary. Furthermore, we show 
that the unitarity can be combined with the average gate fi¬ 
delity to quantify how far a noise channel is from depolarizing 
noise. Finally, we show that the unitarity of a noise channel 
provides a lower bound on the best achievable gate infidelity 
assuming perfect unitary control. 

Our approach to quantifying coherence complements other 
recent work on quantifying coherence since we focus on the 
coherence of quantum operations rather than the coherence of 
quantum states relative to a preferred basis [17]. 


I. DEFINING UNITARITY 

We begin by defining the unitarity of a noise channel 8 : 
B(C d ) —> B(C d ), that is, a completely positive (CP) linear 
map that takes quantum states to quantum states. The purity 
of a quantum state p is Tr p' p £ [0,1] with Trp^p = 1 if and 
only if p is a pure state. An initial candidate for a definition of 
the unitarity of 8 is 

I dipTr[8(ipy8(tp)\, ( 1 ) 

that is, as the purity of the output states averaged over all pure 

state inputs. However, this definition is problematic, since it 
would lead to the nonunital state-preparation channel 

8 o (p) = Tr(p)\0)(0\ (2) 

having the same value of unitarity as a unitary channel, even 
though it does not preserve coherent superpositions. Simi¬ 

larly, the (trace-decreasing) filtering channel 

£ 1 (p) = |0)(0|p|0)(0| (3) 

does not preserve coherent superpositions and so should have 
the same unitarity value as a complete depolarizing channel. 
Both of these problematic channels arise when either the iden¬ 
tity is mapped to coherent terms or vice versa. 

To avoid these issues, we define the unitarity of a noise 
channel to be the average purity of output states, with the iden¬ 
tity components subtracted, averaged over all pure states. That 
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is, we define 

u {£) = J dipTr [£ 1 (ip)* 1 £', ( 4 ) 

where the normalization factor is chosen so that u(I) = 1 and 
£' is defined so that £'{A) = £(A) — \Tr£{A) /\fd\l for all 
traceless A (to account for trace-decreasing channels, such as 
in Eq. (2)) and £'(ld) = 0 (to account for non-unital chan¬ 
nels, such as in Eq. (3)). Equivalently, if {A 2 ,... ,A d 2 } is 
any set of traceless and trace-orthonormal operators and with 
Ai = 1/d (e.g., the normalized Paulis), then we can define 
the generalized Bloch vector n (p) of a density operator p with 
unit trace to be the vector of d 2 — 1 expansion coefficients 

p= l d /d + ^2,n k A k . (5) 

k> 1 

Our definition of the unitarity is then equivalent to 

u(£) = J <hp\\n[£ (ip)]-n[£(l d /d)}\\ 2 , ( 6 ) 

that is, the average squared length (i.e.. Euclidean norm) of 
the generalized Bloch vector after applying the map £ with 
the component due to the identity subtracted off. 

II. THE ESTIMATION PROTOCOL 

We now present a protocol for characterizing the unitarity 
of the noise in an experimental implementation of a unitary 
2-design Q under the assumption that the experimental imple¬ 
mentation of any U £ Q can be written as U o £ where U 
denotes the channel corresponding to conjugation by U and £ 
is a completely positive, trace-preserving (CPTP) channel in¬ 
dependent of U. (Note that, as in all randomized benchmark¬ 
ing papers, the assumption that £ is independent of U can be 
relaxed without dramatically effecting the results [8, 13, 16].) 

The protocol is to repeat many independent trials of the fol¬ 
lowing. 

• Choose a sequence j = (j 1 ,... ,j m ) of m integers in 
N|p| = {1,..., |£7|} uniformly at random. 

• Estimate the expectation value Q- } of an operator Q after 
preparing the state p and applying the sequence Uj = 
Ujm Uj m _ 1 • ■ ■ Uj 1 of operators. In the ideal case that 
£ =1, the expectation value is given by 

Qj = Tr(QU iP Uj). (7) 

We will show in Sec. IV, Theorem 5 that, under the above 
assumptions on the noise, the expected value of Qj over all 
random sequences j obeys 

Ej [Q?}= A + Bu^- 1 (8) 

for trace-preserving noise, where A and B are constants incor¬ 
porating SPAM and the nonunitality of the noise and u(£) £ 


[0,1] is the unitarity of the noise defined in Eq. (4), with 
u(£ ) = 1 if and only if £ is unitary. 

Therefore estimating Ej [Qj] for multiple values of m using 
the above protocol and fitting to Eq. (8) gives an efficient and 
robust estimator of the unitarity. 

A. Estimators 

Note that, as opposed to standard presentations of random¬ 
ized benchmarking, we are considering the expectation of an 
operator Q rather than the probability of a single outcome. 
Though these two descriptions are mathematically equivalent, 
by presenting in terms of observables, we more easily take 
expectations over multiple observables. For example, we can 
average over the non-identity Pauli operators, while keeping 
the sequence the same. As will be discussed in Sec. IV, this 
allows us to simulate a two-state measurement involving S, 
the SWAP operator. We term this the purity measurement , as 
it estimates the relevant state-dependent term in 

TrCPj) = \ + ll n 0.i)l| 2 > (9) 

the purity of the state p- } produced by the sequence j. What we 
actually use is a shifted and rescaled version of this defined by 

p i = j^-yll n (Pj)ll 2 , (10) 

which for physical states is always in the interval [0,1]. For a 
single qubit, and measuring in the Pauli basis, this quantity is 
just Pj = (A') 2 + ( Y ) 2 + ( Z ) 2 , where each expectation value 
is taken with respect to the state p- } . 

The purity measurement can be performed in one of two 
ways. The direct way involves using two copies of the exper¬ 
iment (with the same sequence) that are run in parallel and 
a SWAP gate applied immediately prior to measurement. A 
method using only one copy makes use of the expansion 

S = Y J A k ®A\ ( 11 ) 

k 

for any orthonormal operator basis { A k } (e.g., the normal¬ 
ized Paulis) by adding up the expectation values over mea¬ 
surements in the operator basis for the same sequences, that 
is, by estimating J2 k 

Implementing the purity measurement using this averag¬ 
ing reduces the between-sequence contribution to the uncer¬ 
tainty in our estimates of Ej[Qj], since if the noise is approx¬ 
imately unitary, then the final state will be relatively pure but 
will generically overlap with all non-identity Paulis. We note, 
however, that the above summation over a trace-orthonormal 
basis is not scalable with the number of qubits, since there are 
exponentially many ?i-qubit Paulis. We leave possible opti¬ 
mizations and an analysis of the scalable two-copy protocol 
as an open problem. 

Also note that unlike in standard randomized benchmark¬ 
ing, we do not require the unitary 2-design to be a group since 
we do not require an inverse operation, or even that the set Q 
is closed under composition. 
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B. Trace-decreasing noise 

More generally, some experimental noise £ may be trace- 
decreasing with an average survival rate 

S(£) = J d'0Tr[£('i/>)], (12) 

which is the amount of the trace of the quantum state 'd> that 
survives the error channel £, averaged over the Haar measure 
dtp. When £ is itself the average noise over Q, the average 
loss rate can be estimated by 

(13) 

where C is a constant determined by SPAM [16], 

For trace-decreasing noise, the standard decay curve in 


Eq. (8) can be generalized to 


Ej[Qj] = AA™ -1 + .BA™ -1 , 

(14) 

for some constants A and B where 


A + + A_ =S{£) 2 + u{£). 

(15) 


The above protocol is a variation of standard randomized 
benchmarking experiments, and is very similar to the protocol 
for estimating loss presented in Ref. [16], In particular, one 
estimates an exponential decay rate in an exactly analogous 
manner (see Eq. (8)) and the result is obtained in a manner 
that is robust to SPAM. 

However, there are three small but crucial differences to the 
experimental protocol presented in Ref. [16], leading to sig¬ 
nificant differences in the analysis and interpretation of the de¬ 
cay curves. Most importantly, the post-processing is different, 
since in the present paper the survival probabilities for the in¬ 
dividual sequences are squared before they are averaged. Sec¬ 
ondly, the preparation and measurement procedures in the loss 
protocol of Ref. [16] are ideally the maximally mixed state 
and the trivial (identity) measurement respectively, which is 
out performed in this protocol by the use of the purity mea¬ 
surement. Finally, the current protocol requires a unitary 2- 
design, whereas the loss protocol only requires a unitary 1- 
design (although it also works for a unitary 2-design). 


III. NUMERICAL SIMULATIONS 


We now illustrate our model and our experimental protocol 
by numerically simulating each for a variety of single-qubit 
noise models. In Fig. 1, we give an example of the correctness 
of our model Eq. (8) by showing that it agrees with simulated 
experimental data in the extreme case that the error channel is 
a fixed unitary. In Fig. 2, we demonstrate the utility of Eq. (8) 
as an estimation model by estimating u(£) from simulated 
data drawn according to our protocol. We simulate measure¬ 
ment error on each measurement with small independent ran¬ 
dom orthogonal matrices, scaling the unital components with 
a random factor between 0.95 and 1.0. In both these simula¬ 
tions we simulate SPAM on the prepared state by applying a 
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FIG. 1. (Color online) Comparison of our model (Eq. (8)) with sim¬ 
ulated data, where the error channel is taken to be a fixed unitary 
drawn from the Haar measure (blue), or given by a rotation of 0.1 
about a x (red). In the simulated data, the purity is measured follow¬ 
ing the formula of Eq. (10). The solid lines show the model Eq. (8) 
for the calculated values of A, B and u (£) for each scenario. This 
shows that even for extreme values of unitary noise, our model cor¬ 
rectly predicts the unitarity. 


random near-identity unitary. We choose Q to be the single¬ 
qubit Clifford group. 

Concretely, in Fig. 1 we show two runs. In the first we set 
£ to be some fixed (systematic) unitary chosen randomly ac¬ 
cording to the Haar measure ( Haar-random unitary) and some 
near-identity unitary represented by a rotation of 0.1 radians 
around the A-axis of the Bloch sphere ( near-identity unitary). 
As Fig. 1 demonstrates, our model is in indeed insensitive to 
unitary noise. 

In Fig. 2, we show different types of unital noise composed 
with the nonunital amplitude-damping channel 

£d(p) =p|0)(0| + (1 -p)p (16) 

to simulate relaxation to a ground state. The particular unital 
channels we consider are a Haar-random unitary and a gate- 
dependent noise channel corresponding to choosing a fixed 
perturbation of the eigenvalues of a unitary g by e lf to simu¬ 
late over/under-rotation errors, where the perturbations e are 
chosen independently and uniformly from [—0.1,0.1] radians 
for each gate (rotation channel). 

Note that the statistical fluctuations in Figs. 1 and 2 arise 
from between-sequence variations and within-sequence varia¬ 
tions. The between-sequence variations arise from sampling a 
small number of random sequences (30 sequences in this case) 
relative to the total number. The between-sequence variations 
are minimised by measuring an observable for the purity. A 
perturbation expansion of the form £ = I—rS (where r is the 
average gate infidelity of £ to the identity) together with ap¬ 
propriate bounds on the diamond norm can be used to bound 
these fluctuations and show that they must decrease with gate 
infidelity, as in Ref. [13], However, a more detailed analysis 
is complicated by the complexity of the relevant representa¬ 
tion theory (that is, four-fold tensor products). The within- 
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FIG. 2. (Color online) Numerical purity decay curve for our protocol 
when the noise consists of the non-unital channel in Eq. (16) with 
p = 0.003 composed with: a Haar-random unitary (black crosses), 
and fixed gate-dependent over/under-rotations where the eigenval¬ 
ues for each gate are perturbed by e ±tS for some S G [—0.01, 0.01] 
chosen independently and uniformly (red pluses). The purity is 
measured in the same manner as in Fig. 1. The third plot (blue 
stars) shows (gate-dependent) over/under-rotations composed with 
the nonunital channel with p=0.01. The solid lines give the fit to 
Eq. (8), where the slope gives an estimate for u[£) given by the val¬ 
ues 0.994, 0.993 and 0.978, consistent with the theoretical values of 
0.994, 0.994 and 0.981, respectively. Confidence intervals for u (£) 
were calculated assuming Gaussian noise, such that the least-squares 
fit is a maximum likelihood estimator. 


FIG. 4. (Color online) Unitarity of single-qubit CPTP channels cho¬ 
sen according to the random distributions of Bruzda et al. [18], plot¬ 
ted versus their fidelities to the identity channel. This example shows 
that even though the two quantities are correlated, they are not redun¬ 
dant and give different insight into the structure of the noise. 


QuTiP software package [19] to draw channels and com¬ 
pute their unitarity (see Supplemental Material). As shown 
in Fig. 3, the distribution of unitarities depends strongly on 
the Kraus rank of the random channel. Moreover, as demon¬ 
strated in Fig. 4, this information is correlated with, but dis¬ 
tinct from, the average gate fidelity. 



Unitarity u(£) 

FIG. 3. (Color online) Unitarity of single-qubit CPTP channels cho¬ 
sen according to the random distributions of Bruzda et al. [18] with 
varying ranks of the Kraus operators, demonstrating that the unitarity 
carries information about the structure of the channel. In particular, 
channels which require more Kraus operators to specify tend towards 
much smaller unitarity. 


sequence variations arise from the need to estimate the expec¬ 
tation values of the observables. For the purpose of Figs. 1 and 
2, we used an unbiased estimator of the squared expectation 
values, simulating N measurements (with N set to 150). 

Finally, we consider the unitarity of random channels drawn 
from the random ensemble of Bruzda et al. [18], using the 


IV. DERIVATION OF THE FIT MODELS 

We now derive the decay curve in Eq. (14) for trace-non- 
increasing noise and show how the decay curve in Eq. (8) 
emerges as a special case for trace-preserving noise. 

Since we are dealing with sequences of channels, it will be 
convenient to work in the Liouville representation. Since a 
quantum channel is a linear map between finite-dimensional 
vector spaces, it is always possible to represent it as a matrix 
acting on basis coefficients in some given bases for the vector 
spaces. 

In order to construct the Liouville representation of chan¬ 
nels, let ^4 = {Ai,..., A d 2 } be an orthonormal basis of C dxd 
according to the Hilbert-Schmidt inner product (A, B) = 
Tr A^B. Any density matrix p can be expanded as p = 
SfeGN o(Ak,p)Ak and so we can identify p with a column 

vector Ip) G C d whose fcth entry is ( Ak,p ). The Liou¬ 
ville representation of a channel £ is then the unique ma¬ 
trix £ G C d xd such that £\p) = |£[p]), which has en¬ 
tries £ki = ( Ak,£(Ai )) = (Ak\£\Ai). An immediate con¬ 
sequence of the uniqueness of £ is that the composition of 
abstract maps is represented in the Liouville representation by 
matrix multiplication. 

The Liouville representation of unitary channels forms a 
unitary projective representation of the unitary group U(d). 
When we wish to emphasize the Liouville representation as a 
formal representation (rep) of the unitary group U(d) (or sub¬ 
groups thereof), we will use the notation 4>l{U) instead of if. 
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With this notation, it is easy to verify that <f>L is indeed a uni¬ 
tary representation of U(d), since the Liouville representation 
of composition is matrix multiplication and it can easily be 
verified that 4>l(U^) = <piXU)f 

Any representation (f> of a semisimple group Q [such as 
SU(d)] over a vector space V can be unitarily decom¬ 
posed into a direct sum of irreducible representations (irreps) 
(f>i (g) l ni , where the l label the irreps and the rq are the 
corresponding multiplicities and a rep 0 over a vector space 
V is irreducible if there are no nontrivial subspaces of V that 
are invariant under the action of </>. A particularly important 
irrep for this paper is the trivial irrep fx such that frig) = 1 
for all g € Q. 

In the Liouville representation, vectors b £ C d are in one- 
to-one correspondence with operators B £ C d , so invari¬ 
ant (vector) subspaces under the Liouville representation can 
be identified with operator subspaces that are invariant under 
conjugation in the canonical (i.e., d x d matrix) representa¬ 
tion. In particular, the identity operator 1 is invariant under 
conjugation by any unitary, so |1) is an invariant subspace of 
the Liouville representation corresponding to a trivial irrep. 
We now fix Ai = 1/y/d (so that (A\, Ai) = 1), so that the 
Liouville representation of any unitary U is 

4>l(.U) = 1 © 4>u{U) (17) 

where © denotes the matrix direct sum and we refer to <j> u (U) 
as the unital irrep , which has dimension d 2 — 1. Furthermore, 
any CP channel £ can be written in a corresponding block 
form as 

where we refer to £ s di, £ n and £ u as the state-dependent leak¬ 
age, nonunital and unital blocks respectively. We now show 
how £ u is related to the definition of the unitarity in Eq. (4). 

Proposition 1. The unitarity of a channel £ is 

n(£) = ^-jTr£l£ u (19) 

Proof. For any operator A, Tr A* 1 A = (A | A ) and £' = 
P U £P U where P u is the projector onto the unital irrep, so 
Eq. (4) can be rewritten as 

u{£) = 

= j^Tr£{£ u O , ( 20 ) 

where O = f with the slight abuse of notation 

£ — P £P 

U — ± U c ' 1 u* 

Since O commutes with the action of the unitary group, 
Schur’s lemma implies that it is a weighted sum of projectors 
onto the irreps of fx, 

(21) 


The projector onto the trivial irrep is Pt = A 1 ) (A -1 and so 

= A T =J d^ilVOI 2 = (22) 

Because Tr O = 1 from the normalization of the Haar mea¬ 
sure, we can solve for A u in the expression 

TrO = 1 = gl) + A u (d 2 - 1) (23) 

and we find A u = 1 /d(d + 1). Plugging this in and using 
P u Pt = 0 gives the final result. □ 

Before we derive the decay curve in Eq. (14) using the ex¬ 
pression for the unitarity from Proposition 1, let us first sim¬ 
plify the quantity of interest. The expectation value of Q given 
that the sequence j was applied is 

Qi = iQ\Uj m £ ■ ■ ■ Uj 2 £ Uji \p)> (24) 

where a residual noise term has been absorbed into the exper¬ 
imental state preparation p. Noting that 

q2 = (q® 2| M ®2f®2 _ _, U®^£® 2 U®^\p® 2 ), (25) 

the expected average of the squares is 

Ej[Qf] = i(?r m EQj 

j 

= (Q® 2 1 (w®4£® 2 w®= E" 1 |p® 2 ) 

= (Q® 2 |Al m_1 |p® 2 ), (26) 

where W ®? g = \Q\ 1 Y^g^g 9® 2 - we define the averaged op¬ 
erator A4 = Uf 2 a £' /yI Uf‘ 2 , and we have used the fact that 
\G\ YZg^g $( 9 ) i s the projector onto the trivial subreps for 
any rep f of a group Q so that W® 2 g = (W® v 2 g ) 2 [20]. Thus, to 
derive the fit model we must first identify the trivial irreps of 
Q in since this is where Ai is supported. 

Proposition 2. The averaged operator Ai = U ® 2 g £ ® 2 U ®it 
is supported on a two-dimensional subspace spanned by H^ 2 ) 
and | S'), where S is the SWAP operator. 

Proof. Define \n((j) = Tr R(g) as the character of the 
rep R. Then we can use Schur’s orthogonality relations 
to count the number of trivial irreps. Let ((xRiXR')) = 
ISP 1 Y2 g eg Xr(9)XR 1 ( 9 ) denote the character inner product 
for Q. From the direct sum structure in Eq. (17), the number 
of trivial irreps is 

((Xi) Xl®l)) = ((Xii Xi + 2Xu + Xu)) = 1 + 0 + ((xi) Xu)) • 

Since Xu is real-valued, we have ((xi, xl)) = ((Xu, Xu))- If Q 
acts irreducibly on the unital block [21], then ((XuiXu)) = 1 
and the number of trivial irreps is 2. 

The two trivial irreps in 0® 2 are spanned by the orthonor¬ 
mal vectors \Bi) and IB 2 ) where 

£t = Wd , B 2 = (S-t d 2/d)/y/d 2 -l , (27) 


O = XtPt + A U P U . 
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and S is the SWAP operator. To check this, note that 

OB,|<M£O 02 |5fc) = Tr {B]U® UB k U t 0 17*) = S jk , 

since both identity and SWAP are invariant under conjugation 
by U 0 U and S 2 = 1 ^ 2 . Since <j>i((7)® 2 is a unitary rep, B\ 
and B 2 are the first two elements of a two-qudit orthonormal 
Schur basis { Bj } for (iff 2 and so correspond to trivial irreps. 
Therefore AT is zero except for the 2x2 submatrix supported 
on \B\) and |£> 2 )- These vectors have the same span as 11^2) 
and | S). □ 

The next proposition characterizes the averaged operator on 
the supported subspace. 

Proposition 3. In the invariant Bi basis from Eq. (27), the 
averaged operator AT has the following matrix elements 

• Mu = S(£) 2 , 

• AT12 = (T 2 - l) - 1 / 2 H£ s di|| 2 , 

• AT 2 i = (d 2 - l) -1 / 2 ||£ n || 2 , and 

• AT 2 2 = (T 2 - 1) -1 ||£ u ||f = u{£). 

Proof. We will establish the matrix elements with respect to 
11 d 2 ) and |5), and the claims about the Bi basis will follow 
by taking appropriate linear combinations. 

Because the Bi basis is invariant, we can ignore the average 
unitary terms in AT. We first find that 

(l d 2|5® 2 |l d 2) = Tr£® 2 (l d2 ) = Tr(£(l d )® 2 ) 

= (Tr£(l d )f = d 2 S{£) 2 . (28) 


Proof. Since the averaged operator vanishes almost every¬ 
where, we only need to consider the 2x2 submatrix derived 
above. The nontrivial eigenvalues are 

A± = -ATn + -AT 22 

± \ \![ATn - AT 22 ] 2 + 4 ATi 2 AT 2 i . (31) 

This spectrum is degenerate precisely when the terms under 
the square root both vanish (since both terms are nonnegative). 
Whenever the spectrum is nondegenerate, there are trivially 
two distinct eigenvectors, so we only need to deal with the 
degenerate case. 

We will break the analysis for the degenerate spectrum into 
two nontrivial cases, ATn = AT 22 and either AT 12 = 0 or 
AT 21 = 0, exclusive. There are also two trivial cases: when 
AT 12 = AT 21 = 0, the matrix AT is already diagonal and 
we are done. We ignore the pathological case when ATn = 
0, since this corresponds physically to a state that is never 
observable. In both nontrivial cases, we will make use of the 
two-qudit state II a = gpErjy, the maximally mixed state on 
the antisymmetric subspace. Expanding this state in the Bi 
basis gives 

|n„) = ttiIBi) + n 2 \B 2 ) = 1| Bi) - B 2 ). 

The key feature of this state is that 7r 2 < 0. 

Case 1: AT 12 = 0. In this case, 


AT = 



(32) 


where A > 0 and y > 0. Taking the mth power gives 


Next we can use the identity ( S, A 0 B) = Tr [S(A 0 £?)] = 
Tt(AB) and the fact that £(A*) = £(A)* to find 

(S|£® 2 |l d 2 ) = Tr[S£(l d )® 2 ] = Tr[£(l d )£(l d )] 

= TV[£(l d )t£(l d )] =\\£(l d )\\ 2 F , (29) 

where || • || denotes the Frobenius norm. The expression 
for (l d 2 |£ G2 | S) follows similarly using the adjoint channel. 
Finally, we can use the expansion S = ffi, T k 0 A\. for any 
orthonormal operator basis A k to obtain 

(S|£® 2 |S) = Tr[55® 2 (5)] =^Tr[5£(A fc )0f(4)] 

k 

= Y, Tr[£(A k ?£(A k )] = \\£\\ 2 F . (30) 

k 

The values of the matrix elements are then established by us¬ 
ing the form of Eq. (18) and the definition of the Bi basis from 
Eq. (27) and taking various linear combinations. □ 

The final step in deriving the fit model is to analyze the 
eigenvalues and eigenvectors of the averaged operator. 

Proposition 4. The averaged operator AT has two distinct 
nontrivial eigenvectors. 


AT™ = A™" 1 f A ? V (33) 

\ my A J 

If we perform the measurement {II a , 1 — n o } on a system 
prepared in the state l^/d 2 which evolves under AT™, then 
the probability of observing the outcome II a is 

\m —1 

(n a |M m |l d 2 /d 2 ) = —T— (A7 Ti + my7r 2 ). (34) 

a 

Since A, 7Ti > 0, y > 0, and 7r 2 < 0, in order for this to be a 
probability for all to, we require y = 0 and so AT is actually 
diagonal. 

Case 2: AT 2 i = 0. In this case, 

m =(oa)- <35) 

where A > 0 and y > 0. Taking the mth power gives 

AT m = A ™ -1 ^ A Y ) • < 36 ) 

Therefore the probability of detecting the system (i.e., mea¬ 
suring l,/ 2 ) when a system is prepared in the state II a and 
evolves under AT™ is 

(l d 2 1 AT™ |n a ) = A ™- 1 (ATT! + myirf) . (37) 
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Again since A,7 Ti > 0, y > 0, and -k 2 < 0, for this to be a 
valid probability for all m, we require y = 0 and so Ad is 
actually diagonal. □ 

We now have all the ingredients to derive the fit models of 
Eqs. (8) and (14). 

Theorem 5. For time- and gate-independent noise, the ex¬ 
pected value Ej[Qf] obeys the decay equation 

Ej [Q?}=A + Bu (£) m ~ 1 

for trace-preser\’ing noise, and for trace-decreasing noise it 
obeys 


Ej[Qj] = AX’?- 1 + BX™- 1 , 

where A± are given by Eq. (31), A++A_ = S(£) 2 +u(£), and 
the constants A and B depend only on state preparation and 
measurement errors and the unitary that diagonalizes A A. 

Proof. Proposition 4 establishes that the matrix A4 is diag- 
onalizable by a similarity transform with eigenvalues given 
by Eq. (31). From Eq. (26), we can diagonalize Ad and ab¬ 
sorb the similarity transform into |p® 2 ) and (Q® 2 | as SPAM, 
yielding 


Ej[Qj 2 ] = (Q® 2 |M m "V 2 ) 

= AA”- 1 + BX ™- 1 . (38) 

Trace-preserving noise is a special case of this, since if £ is 
TP, then by Proposition 3 we have A + = S(£) 2 = 1 and so 
A_ = u(£). □ 

We note that the unitary that diagonalizes Ad. will in gen¬ 
eral depend on the noise channel, and hence will depend on 
u. We conflate this dependence with the SPAM errors in our 
fit model, as the diagonalization of Ad does not depend on 
the sequence length m. Neglecting the dependence on u thus 
results in a model that is correct, but is slightly less sensitive 
to u than is optimal. 

We are now equipped to formalize the observation made in 
Sec. II that the optimal observable for trace-preserving noise 
is an operator proportional to B 2 . This follows from noting 
that such operators overlap fully with the component of Ad 
that give rise to the exponential term, as given by Eq. (30). 


V. PROPERTIES OF THE UNITARITY 

We now prove some properties of the unitarity for CPTP 
channels that make it a practical quantification of the coher¬ 
ence of a channel. We begin by proving that the unitarity 
and the average incoherent survival probability can be used to 
bound the nonunital and state-dependent leakage terms which 
are subtracted off in the definition of unitarity in Eq. (4). 

Proposition 6. For any channel £, 

max{||£ n || 2 , ||£ sdl || 2 } < \(d 2 - 1)[S (£) 2 - u(£)]. (39) 

If £ is trace-preserving, then ||£ n || 2 < (d — 1)[1 — u(£)\. 


Proof. Consider the maximally mixed states on the symmetric 
and antisymmetric subspaces, II S = and II a = 

respectively, and let E s and E a be the respective projectors 
onto these spaces. Expanding these states in the Bi basis gives 


|n s ) 

|H a) 


\Bi) 




Vd^l 
d(d + 1) 
Vd^l 
d(d- 1) 


I b 2 ) 

I b 2 ). 


(40) 


Preparing the state II S (II a ), evolving under A4 and then mea¬ 
suring the POVM {E a ,l — E a } ({E s ,l — E s }) produces the 
outcomes E a ( E s ) with probabilities 


Pas = (K|A4|n s ) 


d—1 


2 d 

Psa = (^ S |AT|n a ) 

d + 1 
2 d 


u(£) - 

l|£n|| 2 , 

llfsdlll' 

d-1 + 

d + 1 

u(£) + 

l!£n|| 2 

II^BdlU 

d + 1 

d—1 


(41) 


(42) 


respectively, where we have used Proposition 3. Since both 
these expressions are probabilities we have 0 < p as < 1 and 
0 < Psa < 1. Taking appropriate linear combinations of these 
two inequalities will cancel the dependence on either |£, || 2 
or ||£ s di|| 2 , isolating the other variable. Simplifying the re¬ 
sulting expressions gives the bound Eq. (39) for both quanti¬ 
ties individually, hence the maximum holds as well. 

Furthermore, if the noise is trace-preserving, then 
||£sdi|| 2 = 0 and S(£) = 1, so p as > 0 gives ||£„|| 2 < 
(d — 1) [1 — u(£)\ for trace-preserving noise. □ 

We now prove that u(£) = 1 if and only if £ is unitary and 
that u(£) is invariant under composition with unitaries. 

Proposition 7. For any channel £, u(£) < 1 with equality if 
and only if £ is unitary. Furthermore, the unitarity satisfies 
u(V o £ o If) = u(£) for any unitaries U,V £ U ( d ). 

Proof. The unitary invariance u(V o £ o U) = u(£) follows 
immediately from the invariance of the trace under cyclic per¬ 
mutations. 

Since the norms of vectors are always nonnegative, u(£) = 
1 only if £ is trace-preserving and unital by Eq. (39), in which 
case the adjoint channel £, '' is also a channel [22] and so the 
eigenvalues of S ' £ (i.e., the singular values of S) are all 
bounded by one [23], Therefore u(£) = 1 only if £ is uni¬ 
tal and all the eigenvalues of S have unit modulus and con¬ 
sequently if |det£| = 1. However, the only channels with 
|det£| = 1 are unitary channels [24], Since u(£) is unitarily 
invariant and u(I) = 1, u(£) = 1 if and only if £ is unitary, 
as claimed. □ 

We now show that the unitarity can be used with the average 
gate infidelity to quantify the intermediate regime between in¬ 
coherent and unitary errors. It is useful to define a notion of 
average gate infidelity that has been optimized to remove uni¬ 
tary noise. First recall the definition of average gate infidelity. 


r{£) = 1 - 


dipTr[ip£(ip)]. 


(43) 
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Then for any CPTP channel £, define 

R(£) = min r(V o £ o U) . (44) 

U,VGU{d) 

This quantity can be thought of as the best average gate in¬ 
fidelity that is achievable with perfect unitary control. For 
example, if E is a unitary channel, then R(£) = 0. 

Proposition 8. For any CPTP channel £ with average gate 
infidelity r = r (£) to the identity and R = R(£) as above, 
then the following inequalities hold 

u{£) > [1 - dR/(d - l)] 2 > [1 - dr/(d - l)] 2 . (45) 

The chain of inequalities is saturated if and only if £ has a 
unital block £ u that is a diagonal scalar matrix. 

Proof. Any channel with infidelity r to the identity can be 
written as £ = X — r A where the diagonal entries of A are 
nonnegative and TrA = did + 1). We then have 

d 2 d 2 

ll^ulli > ~ A kkr) 2 = d 2 - 1 - 2rTrA + ^ A 2 kk r 2 , 

k =2 k =2 

with equality if and only if £ u is diagonal. The term A kk 
is uniquely minimized for nonnegative A/,/,, subject to the con¬ 
straint TrA = d{d + 1) by setting Akk = d/{d — 1) (that is, 
by setting all the diagonal entries to be equal). This proves 
the weaker inequality bounding u(£) in terms of r. To get the 
stronger inequality in terms of R{£), we use the unitary in¬ 
variance proven in Proposition 7 and optimize the inequality 
over all unitary channels. □ 

We note that the first inequality in Eq. (45) is saturated at 
1 when the noise channel is unitary, and the chain of inequal¬ 
ities is saturated for depolarizing noise, or depolarizing noise 
composed with amplitude damping. 

An immediate corollary of Proposition 8 is that the unitarity 
can be used to put a lower bound on the best possible average 
infidelity in the presence of perfect unitary control. Rearrang¬ 
ing Eq. (45), we find a lower bound 

^ (1 - y/^M) < R{£) < r{£) ■ (46) 

The unitarity is also closely related to the purity of the 
Jamillkowski state associated to the noise channel. 

Proposition 9. The unitarity is related to the purity of the 
Jamiolkowski state by 

d 2 Tr [J(£)l J(£)\ = S(£) 2 + ||£ sdl || 2 + ||£ n || 2 

+ {d 2 - 1 )u{£) (47) 

where J (£) = {£ (g> X)[<f>] and § = \ J2j,k I jj){kk\. 

Proof. We begin with an alternate representation of the maxi¬ 
mally entangled state $ = d -1 A k (g> Ak- By cycling the 


adjoint channel in the trace, the purity of J(£) becomes 

Tr [J{£)U{£)\ = Tr[$(£ t £ ® !)[$]] 

= ®Z|$) 

= ^^(AjtgiAj |£ f £ ®T\A k ® A k ) 

j,k 

= \^£\ A k)( A i\ A k)- ( 48 ) 

j,k 

Since the A k are a trace orthonormal basis, the last line sim¬ 
plifies to 

Tr[j(£)tj(£)] = ^Tr5 f 5. (49) 

Comparing this expression to the decomposition in Eq. (18) 
and using Theorem 1 completes the proof. □ 

Finally, we give a simple example that shows that the uni¬ 
tarity is not a monotone, in the sense that it can oscillate under 
composition of channels. Consider the two (nearly) dual qubit 
channels, 

£o(p) = Tr(p)|0)(0| and ^(p) = (0|p|0)^ . 

Then the unitarity of both Eq and ^ is zero, while the uni¬ 
tarity of the composed channel d £ 0 £o is 1/12. 

We note that for some restricted classes of channels the uni¬ 
tarity is indeed a monotone. For example, a trivial application 
of von Neumann’s trace inequality shows that if the singular 
values of the unital block are all less than or equal to 1 (which 
holds for all qubit channels and all unital channels), then it is 
a monotone for trace-preserving channels. 

VI. CONCLUSION 

In this paper, we have shown that the coherence of a noisy 
process can be quantified by the unitarity, which corresponds 
to the change in the purity (with the identity components sub¬ 
tracted off) averaged over pure states. We have presented a 
protocol for efficiently estimating the unitarity of the average 
noise in the implementation of a unitary 2-design. 

We have also proven that the unitarity is 1 if and only if the 
noise source is unitary and provided a tight lower bound for 
the unitarity in terms of the infidelity (which can be estimated 
using randomized benchmarking [8]). This allows the inter¬ 
mediate regime between fully incoherent and unitary errors 
to be quantified, potentially allowing for improved bounds on 
the worst-case error. We have also shown that the unitarity 
provides a lower bound on the best achievable gate infidelity 
assuming perfect unitary control. 

Our present results also have direct implications for the loss 
protocol when applied to a unitary 2-design, since the vari¬ 
ance over random sequences of fixed length for the protocol 
in Ref. [16] is 

¥ j (Q j )=E j (Q|)^[E j (Q j )] 2 , (50) 




9 


which decays faster with m for fixed S(£) if the unitarity is 
smaller (and hence the two decay rates in the fit curve for de¬ 
termining the unitarity, A±, are smaller). A lower variance 
over sequences allows a more precise estimation of the av¬ 
erage incoherent survival probability for a fixed number of 
experiments. Similar implications may also hold for standard 
randomized benchmarking since u(£) can easily be seen to be 
one of the eigenvalues of the averaged operator in Ref. [13] 
that determines the variance and is precisely the eigenvalue 
that determines the asymptotic variance. However, in order 
to establish a concrete bound, it would have to be shown that 
u(£) is in fact the largest eigenvalue. 

There are four important open problems raised by this 
work. First, while the unitarity is a monotone for unital noise, 
it is not a monotone for trace-decreasing noise. We leave open 
the problem of finding necessary and sufficient conditions for 
when u(£) is a monotone, or finding other quantities that are 
monotonic in general. 

Second, our protocol characterizes the unitarity of the aver¬ 
age noise, but does not characterize the unitarity of the errors 
in the individual gate. While a variant of interleaved random¬ 
ized benchmarking [25] should hold for the current protocol, 
obtaining reasonable bounds on the unitarity of the individual 
error is an open problem. 

Third, the signal for our protocol is substantially improved 
by the purity measurement, but the method of performing the 
purity measurement via measuring Pauli operators is not scal¬ 
able beyond a handful of qubits because of the exponential 


size of the Pauli group on n qubits. Moreover, measuring any 
single Pauli operator will in general give a small signal as the 
number of qubits grows, since we do not perform an inver¬ 
sion step. Directly using the SWAP operation on two copies 
of the system running in parallel is a mathematical solution, 
but the extra resources required to implement this might be 
prohibitive and an analysis of the role of crosstalk and corre¬ 
lations would be required to justify this idea. Thus, identifying 
efficient measurements that give a good signal on multi-qubit 
systems remains an open problem. 

Finally, a pressing open problem identified in this paper is 
to obtain an improved bound on the worst-case error in terms 
of both the infidelity and the unitarity. Such a bound would 
substantially reduce the effort required to certify that an ex¬ 
perimental implementation is near (or below) the threshold 
for fault-tolerant quantum computation. 
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